Mon. Not. R. Astron. Soc. 000, 1-11 (2002) Printed 5 February 2008 (MN M^X style file v2.2) 



Cross-correlation studies as a probe of reionization physics 
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ABSTRACT 

The process of reionization is now believed to have proceeded in an orchestrated man- 
ner beginning with UV photons emitted by high redshift galaxies containing a large 
fraction of Population III stars carving out ionised regions around them. The physics 
during this era can be studied with a combination of redshifted 21-cm spin- flip transi- 
tion tracing neutral hydrogen gas, IR emission from massive primordial stars that trace 
the global star-formation rate during reionization, and the imprint of hot-electrons 
in first supernovae remnants Compton-cooling off of cosmic microwave background 
(CMB) radiation through the Sunyaev-Zel'dovich effect. While these individual ef- 
fects and their observable signatures have been advocated as probes of reionization 
history, here we show how cross-correlation studies between these signals can be used 
to further understand physics during reionization. Cross-correlation studies are ad- 
vantageous since the measurable statistics do not suffer in the same manner from 
foregrounds and systematic effects as is the case of auto-correlation function measure- 
ments. We discuss the prospects for detecting various cross-correlation statistics using 
present and next generation experiments and the information related to reionization 
captured by them. 

Key words: cosmology: theory — large scale structure — infrared: general — stars: 
formation — cosmology: observations — diffuse radiation 



1 INTRODUCTION 

The 21-cm spin-flip transition of neutral Hydrogen, either in 
the form of an absorption or an emission relative to Cosmic 
Microwave Background (CMB) blackbody spectrum, pro- 
vides one of the best ways to study the intergalactic medium 
during and prior to reionization (Scott & Rees 1990; Tozzi 
et al. 2000; Madau et al. 1997; Furlanetto et al. 2004; Santos 
& Cooray 2006). With frequency selection for observations, 
the 21-cm line, in principle, provides three-dimensional to- 
mography of the reionization era as well as a probe to the 
dark ages where no luminous sources are present. The ex- 
act physics associated with the reionization process is still 
largely unknown, though it is strongly believed that UV 
photons from first luminous sources are responsible for it 
(Barkana & Loeb 2001; Furlanetto et al. 2006). These UV 
photons create bubbles (Wyithe & Loeb 2003; Cen 2003a,b; 
Haiman & Holder 2003; Mackey et al. 2003; Santos et al. 
2003; Yoshida et al. 2003, 2004; Zahn et al. 2006) of ionised 
material around them, although it is still unclear whether 
densest or least-dense regions were ionised first (Furlanetto 
& Oh 2005; Cohn & Chang 2006). Direct detection of auto- 
correlation spectra of this signal might prove to be difficult, 
due to very strong foregrounds, although various cleaning 



techniques have been proposed (Zaldarriaga et al. 2004; San- 
tos et al. 2005; Morales et al. 2006) 

The same reionization redshifts can also be probed at 
near-IR wavelengths since the intensity of the cosmic near- 
infrared background (IRB) is a measure of the total light 
emitted by stars and galaxies in the Universe. The possi- 
bility that there is a high-redshift component to the IRB 
comes from the fact that the absolute background estimated 
by space-based experiments, such as the Diffuse Infrared 
Background Experiment (DIRBE; Hauser & Dwek (2001)) 
and the Infra-Red Telescope in Space (IRTS; Matsumoto 
et al. (2005)), is significantly larger that the background ac- 
counted for by resolved sources so far: only 13.5 ± 4.2 nW 
m~ 2 sr _1 is resolved to point sources at 1.25 /im (Cambresy 
et al. 2001), while current direct measurements range from 
25-70 nW m~ 2 sr _1 (see Kashlinsky (2006) for a recent re- 
view) . 

Primordial galaxies at redshifts 8 and higher, especially 
those involving Population III stars, are generally invoked 
to explain the missing IR flux between 1 /j,m and 2 /im, with 
most of the intensity associated with redshifted Lyman- a 
emission during reionization (Santos et al. 2002; Salvaterra 
& Ferrara 2003; Cooray & Yoshida 2004; Fernandez & Ko- 
matsu 2006). While models of high-redshift Pop III popula- 
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tions can explain the "missing" IRB, these models run into 
several difficulties if such sources were to account for all of 
the missing IR intensity. These include the high efficiency re- 
quired to convert baryons to stars in first galaxies (see e.g. 
Madau & Silk 2005) and limits from deep IR imaging data 
that suggest a lack of a large population of high-redshift 
dropouts (Salvaterra & Ferrara 2006). Still, one does expect 
some contribution to the IRB from sources that reionized 
the Universe, though the exact intensity of the IRB from 
such sources is yet unknown both theoretically and observa- 
tionally. 

As pointed out in Cooray et al. (2004) (see also Kash- 
linsky et al. 2004), if a high-redshift population contributes 
significantly to the IRB, then these sources are expected to 
leave a distinct signal in the anisotropy fluctuations of the 
near-IR intensity, when compared to the anisotropy spec- 
trum associated with low-redshift sources. Such a study has 
been attempted with anisotropy measurements using Spitzer 
data with preliminary indications for an excess anisotropy 
in the background at arcminute angular scales and below 
(Kashlinsky et al. 2005). There are, however, large uncer- 
tainties on the exact fraction of IRB intensity from red- 
shifts during reionization (Sullivan et al. 2006; Salvaterra 
et al. 2006), especially given the suggestions that these ex- 
cess IRB fluctuations can be partly explained with expected 
clustering from faint unresolved galaxies in optical images 
but not present in IR images (Cooray et al. 2006). Here, 
we suggest that an approach to establish the presence of 
IRB anisotropies from the era of reionization, such as those 
due to first-stars, will be to consider a cross-correlation of 
the IRB against brightness temperature fluctuations in the 
21-cm background. 

Finally, the small-scale cosmic microwave background 
(CMB) anisotropies are also expected to contain signa- 
tures from the reionization epoch (Santos et al. 2003; Zahn 
et al. 2005; McQuinn et al. 2006, 2005; Alvarez et al. 
2006). Uniquely identifying the fractional anisotropy con- 
tribution from reionization era is challenging with CMB 
data alone since a large number of sources contribute to 
the anisotropies of CMB at arcminute angular scales and 
below. These include thermal Sunyaev-Zel'dovich (SZ) ef- 
fect from low-redshift galaxy clusters, the Ostriker-Vishniac 
effect associated with bulk motions of the electron den- 
sity field, and gravitational lensing modifications to CMB 
anisotropies. In terms of reionization contributions, one 
expects an anisotropy when electrons in the surrounding 
medium of first supernovae explosions Compton-cool against 
the CMB (Oh et al. 2003); the modification to CMB is sim- 
ilar to the y-distortion related to SZ scattering in clusters. 
Such a signal need not be associated with supernovae alone, 
as X-rays from first quasars, supernovae, and star clusters 
are also expected to heat the intergalactic medium (IGM) 
at high redshifts (Oh 2001; Venkatesan et al. 2001). Such 
a high-redshift contribution could also help explain the ex- 
cess CMB anisotropy at arcminute angular scales, as seen by 
CBI (Mason et al. 2003), BIMA (Dawson et al. 2002; Daw- 
son et al. 2006), and ACBAR (Kuo et al. 2004; Kuo et al. 
2006) experiments. 

If a high-redshift SZ contribution exists, an interesting 
possibility to identify it involves the cross-correlation be- 
tween small-scale CMB anisotropy maps and the 21-cm sig- 
nal. If the IGM is heated to a temperature above the CMB, 



as due to X-ray heating, then the 21-cm signal will be de- 
tectable as an emission. While SZ originates from the ionized 
electrons and the 21-cm signal is related to the neutral Hy- 
drogen distribution, with partial reionization, one expects 
the two signals to spatially anti-correlate as the regions con- 
taining free-electrons trace the same underlying density field 
defined by dark matter that also leads to fluctuations in the 
neutral Hydrogen. The cross-correlation between arcminute- 
scale CMB and the 21-cm can be used to measure the angu- 
lar power spectrum of this (anti)-correlation and help estab- 
lish the fraction of SZ effect originating from high-redshifts. 

In this paper, we discuss several cross-correlations in- 
volving 21-cm data and tracers of reionization discussed 
above. Since 21-cm observations can be preselected based on 
the choice of a frequency bin corresponding to the redshifted 
line emission, one can consider the correlation as a function 
of the redshift bin and use that information to establish the 
redshift dependence of the high-redshift SZ signal as well as 
the IRB component related to primordial galaxies. Here, we 
discuss the measurement of the proposed cross-correlations 
using CMB maps from upcoming missions and maps of the 
IRB from wide-field images. A cross-correlation of the 21-cm 
signal with galaxy surveys has been discussed in Furlanetto 
& Lidz (2006) and Wyithe & Loeb (2006). 

In this paper we use a simple analytic model to cal- 
culate and understand the qualitative features of the cross- 
correlation. For more accurate quantitative predictions, one 
should use the numerical simulations, which have recently 
made significant advances (Iliev et al. 2006a; Mellema et al. 
2006; Iliev et al. 2006b; Kohler et al. 2005; Zahn et al. 2006). 

This paper is organized as following. In Section 2, we 
present our basic model for the auto- and cross-correlations 
of various fields discussed here. In Section 3, we consider a 
physical model of reionization and discuss our results and 
observability of various statistics. Finally, we conclude with 
a summary of our results in Section 5. 



2 METHOD 

In this Section, we develop simple models for 21cm, high- 
redshift SZ, and IR backgrounds and their cross-correlation. 
To describe the 21-cm background, we closely follow the 
formalism developed in Zaldarriaga et al. (2004) and extend 
it to encompass the IR and SZ fluctuations as well. The basic 
starting premise is that the 21-cm fluctuations are tracing 
the dimensionless brightness temperature field 

i>(n,r) = (l + 8)x H , (1) 

where S is the fraction over-density and xh is the neu- 
tral fraction, while the IR and SZ fluctuations trace the 
"sources" field 

4>{h,r) = {l + S a ). (2) 

If we assume that during reionization, the early sources 
form bubbles of nearly completely ionised gas so that xh is 
strongly bimodal and that this gas is everywhere hot enough 
to produce SZ fluctuations and that it is filled with pri- 
mordial sources emitting in (observed-frame) infrared light, 
then it is reasonable to expect that 4>(h, r) oc (1 + 8)xs with 
xs — 1—xh- Here we take a marginally more realistic model, 
by assuming that sources occupy just some central fraction 
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we can define the angular power spectrum 

{ai m a£i m i) = 8ui8 mm iC" n , (7) 

so that 



where 



(8) 



A^(fe) = j^P^ik) = ^5 D (k + k') <^(k)^(k')> (9) 
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Figure 1. This figure shows schematically, the neutral fraction 
field xh (top left), the "negative" neutral fraction field (top right) 
and the sources field xg (bottom) used in this paper. 



of the reionization halos as illustrated in the Figure 1. To 
what extend this is a plausible approximation remains to be 
seen, but it is a useful starting point for the cross-correlation 
study we propose here. 

We will use superscripts n, i, y to describe the 21-cm 
spin flip transition, i for the IR signal and y for the SZ signal 
from the epoch of reionization. For example, C™ 1 denotes the 
angular cross-correlation power spectrum between the 21-cm 
and the IR signals. We will adopt units of mK for th 21-cm 
signal, nW/m 2 /sr for the IR signal and the observed fj>K 
decrement in the Rayleigh- Jeans region for the Comptoniza- 
tion parameter (y). The latter is given by AT = — 2TcmbJ/- 

Through the paper we use a standard flat cosmology 
with n b = 0.05, On = 0.3, fl A = 0.7, cr 8 = 0.8 and n s = 1, 
where the symbols have their standard meaning. 

2.1 21cm background and power spectrum 

In the limit of no redshift distortions and spin temperature 
of hydrogen being much larger than the CMB temperature, 
the observed brightness temperature on the sky is given by 
(Zaldarriaga et al. 2004) 



ft) = T„(r ) J 



drW(r)ip(h, r), 



(3) 



were W(r) is the window function describing the instrumen- 
tal bandwidth around some target redshift r , normalized 
such that J drW(r) — 1 and 



T (r) = 23mK(i^ (^-) ' 
v ; V0.02/ V0.15/ 
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with ip being defined in equation (1) 
If we expand ip in Fourier series 

d 3 k 



iKx) = J ■ 



V"fe(k)e l 



(4) 



(5) 



(2tt)3 

and then use the spherical harmonic decomposition of the 
observed temperature 



dim — 4™ / 7 ^Vfc(k)o i (fc,i/)y/ m (k) 
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(k,v)=T {r ) j drW(r)jt(kr), 



(6) 



2.2 IR background and power spectrum 

We now repeat the exercise for the IR flux from the first 
stars. The specific intensity is given by 



with 



dl v 
dr 



T(n)=|dr^(n,r), (10) 
1 dV. r ., s . I*(i/(l + z))(l + z) 



r) dr 



#(r)t* 
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(11) 



where <j>{r) is the average value of 4> (defined in equation 
(2)) at a given distance from the observer. Other quantities 
take care of producing the correct mean intensity: $ is the 
star- formation rate per unit volume at a given redshift, t* ~ 
5 x 10 6 yr, Z* is the mean spectrum per unit mass of a Pop III 
star and Di is the luminosity distance. We take the spectrum 
of first stars from Santos et al. (2002) assuming unity escape 
fraction. Assuming a different escape fraction would change 
our results, but quantitatively the would remain the same. 

In Figure 2, we plot intensity spectra for a 100 Mq mass 
star at redshift of z — 15, but observed today. Note the pro- 
nounced (even with a logarithmic scale!) Lyman-alpha peak 
(due to collisionally excited atoms, see Santos et al. (2002)) 
at around few microns in wavelength. At lower wavelengths, 
this emission dominates the total intensity signal from such 
a star. This means that if our observing wavelength is such 
that the integral includes a contribution from the peak, then 
the resulting intensity power spectrum would be dominated 
by this emission. 

If we expand <f> in Fourier series and use the spheri- 
cal harmonic decomposition in exact analogy with equations 
(5)-(8) we can define the angular power spectrum given by 

C?=4*J™Al4(k)fi(k), (12) 



where 



Pt{k 



,u) = J dr 



dr 



■jt{kr) 



(13) 



and 



= ^P**(k) = |^(k + k')<<Mk)<Mk')> 



(14) 



2.3 High-redshift SZ component 



The high-redshift SZ component is handled in a very similar 
manner. We write the total Comptonization parameter as 

dy 



dr^-d>(h, r) . 
dr 



(15) 
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Figure 2. The observed spectrum of 100 solar mass primordial 
Pop-Ill star at rcdshift of z = 15 as a function of the observed 
wavelength. The solid line is for unity escape fraction and the 
dotted line is for zero escape fraction - see (Santos et al. 2002) 
for details. 



To calculate the Comptonization parameter, we model 
each supernovae as a spherical ball of radius R. The 
mean Comptonization parameter per SN remnant is y = 
kbT e / (m e c 2 )aTn e 4:R/3 (ot is the Thomson scattering cross- 
section, m e is the electron mass) and each remnant con- 
tributes over ■k(R/Da) 2 per steradian to mean y. Assuming 
that SN energy E Bn ~ 10 44 J is evenly distributed between 
electrons (E sn = N e kbT e ) one arrives at 



1 dV 



dy ^ 

dr (j>(r) dr 



m e c z 



(16) 



where M prog ~ 5OM is a typical progenitor mass, and J7 sn ~ 
0.5 — 1 takes into account the fact that not all stars end with 
a supernovae and that not all energy is deposited into hot 
electrons. 

We can we can now define the angular power spectrum 
given by 



where 



Cf =4 7 r|^A^(fc) 7 , 2 (fc), (17) 



2.4 The cross-correlation power spectrum 

When cross-correlating with the 21-cm, the cross-correlation 
power spectra are now defined in exactly the same manner: 



CT = 4tt | ^A^(fc)a £ (fc)/3/(fc) 
C?" = 47V J ^-A 2 ^(k)a e (k) 7e (k) 



4tt 



J k 



Al4k)f3 e (kh e (k) 



(19) 
(20) 
(21) 



where 

A^W = ^P^(k) = ^5 D (k + k') <^(k)0(k')> , (22) 

is the three-dimensional cross-correlation between 21-cm 
brightness temperature and source fields. In above, indices 
i, n and y represent anisotropics in the IR, 21-cm brightness 
temperature (neutral Hydrogen), and the CMB (due to SZ 
y-parameter) , respectively. 



3 CROSS-CORRELATION CALCULATION 

To proceed, we need to produce a model of correlations of 
the neutral fraction. If we assume that a) bubbles are un- 
corrected and b) that any pocket of gas is either completely 
ionised or completely neutral, then a good model is (Zaldar- 
riaga et al. 2004) (see also Furlanetto et al. 2004) 



(xh{x)xh(x!)) = x 2 H + (x H - x 2 H )f(r/R b ), 



(23) 



where r = |x — x'|, R is typical (or effective) bubble size, 
f(x) goes to unity at zero and to zero at infinity. So de- 
fined auto-correlation function has the desired property that 
(xh(x-)xh{x')) — > x 2 H as r — > oo (bubbles are uncorrelated!) 
and (xh(x)xh{x')) — ► xh as r — > (because xh is either 1 
or 0). 

Similarly, the auto-correlation of the xs field satisfies 

(^(x)xs(x')) = x% + (xs - x 2 s )f(r/R s ), (24) 

with the condition that xs = (1 — xu)(Rs/Rb) 3 (see Figure 
1). To calculate the cross-correlation correctly, one would 
need to calculate it given function form of f(x). Here we 
approximate it with one that has correct limits, i.e. 



(ih(x)is(x')) = (x s x H ) [1 - f(r/R bs )] . 



(25) 



In this calculation we have simply assumed a constant bub- 
ble size of Rb = 4Mpc/h with sources size R s = 2Mpc/h 
and the cross- correlation size R bs = 2.25Mpc/h. 

We can now calculate auto and cross correlators of i\> 
and 4>- These are given by 

(V(x)V(x')) - (i>f = [x 2 H + (x H - x 2 H )f(r/R b )] C(r)+ 
(x H - x 2 H )f(r/R b ) + ■q H (r)[2x H + Vh(v)] (26) 

<<Kx)«Mx')) - {<Pf = [x% + (xs - x 2 s )f(r/R s )] C(r)+ 

(xs - x 2 s )f(r/R s ) + 775(r)[2i s + r,s(r)] (27) 



je(k) 



I 



dr ^l je (kr) 
dr 



(18) 



<V(x)0(x')> - (0) (V) = [(x H xs)(l ~ f(r/R bs )] C(r)- 
(x H xs)f(r/R bs ) + x H ris(r) + xsr)H(r) + nHns(r) (28) 
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where r/H.s is the cross-correlation between neutral fraction 
and density ijH,s(r) = (<5(x)xjj,s(x')) and £ is the density 
contrast correlation function f(r) = (<5(x)<5(x')). 

Assuming ij — 0, we can now Fourier transform these 
to obtain the relevant power spectra: 



r P(fc) 



x 2 H )(P fs (k) + Pf(k)) 



*W(fc) = (xsfP(k) + (x s - x 2 s )(P fs (k) + P f (k)) 
P**{k) = (x„xs)(P(k) - P f s(k) - P f (k)), 



(29) 
(30) 
(31) 
(32) 



where Pf(k) is the Fourier transform of /(r), using the ap- 
propriate radius Rb, R s or Rb s and P/s is the Fourier trans- 
form of /£. 

Finally, we take the highly biased nature of primordial 
sources into account heuristically by using a biasing factor. 
The mean bias is given by 



b = 



dMdN/dMMb(M, z) 
J™ . dMdN/dMM 



(33) 



We multiply the linear power spectrum by b 2 for IR and SZ 
and their cross-correlation spectra and by the b(z) for the 
cross power spectra of 21cm with either IR or SZ signals. A 
technically more correct way of performing this calculation 
would be to correctly take into account the cross-correlation 
field rj. 

We use the linear bias function (we take the simple 
linear bias of Mo & White (1996)) and M m i n is the min- 
imal mass corresponding to halos of virial temperature of 
2 x 10 4 K. 



3.1 Star-formation history and reionization 

To complete our calculation, we need to also specify the star- 
formation rate and reionization history. In early universe, 
massive enough halos accrete and cool gas to form stars. A 
halo of mass M converts a fraction of /*/b of its mass into 
first massive stars that later explode as supernovae, where 
/* ~ 0.2 is the fraction of baryons that fragment to form 
stars and fb = Ob/fi m ~ 0.2 is the baryon fraction. 

The total density in stars at any redshift is given by 



P*0) 



f 

J M 



Pi ,(M,z)dM 



F 

J M 



hf.M^dM, (34) 



where dN/dM is the number density of halos per unit mass 
range. We take the Press-Schechter (Press & Schechter 1974) 
form of the number counts. The star-formation rate per co- 
moving volume for halos with mass between M and M + dM 
is then simply 



tt(M,*) = lp*(M,z) = ^p*(M,z) (35) 



with 

^ = H {1 + z) (fi m (l + zf + Q k (l + zf + Q A ) 1/2 (36) 

We can also make a very simple model that connects 
the reionized fraction and star-formation activity. If we take 
a steady-state approximation and assume that q ~ 4000 
ionizing photons are emitted per baryon in stars, then the 
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Figure 3. The neutral fraction (xh) as a function of redshift 
as estimated by the equation (37). The dashed line shows the 
4x[i (1 — xh) - a quantity that is directly proportional to ampli- 
tude of cross-correlation power spectra. 



ionised fraction is simply given by 

qt ICC dp* 



1 — xh = min 



(37) 



fibperit d£ 

where the recombination time scale is given by Madau et al. 
(1999) 



where we assumed the clumping factor C 



("HIl)/("HIl) 



50. The resulting neutral fraction is 



plotted in the Figure 3. 



3.2 Approximations 

Two common approximations are used in the calculation 
of power spectra, depending on the ratio of survey depth 
compared to the angular scales of interest. If 5r /(Da^ 1 ) <C 
1, then window function can be approximated as a delta 
function and the equation (8) reduces to 



C7 



= 4 7 rT (ro) 2 J ^-A 2 ^(k)j 2 e (kr ), (39) 

and equivalent for equations (12) and (19). 

The other limit is when 6r/(DA£~ 1 ) 3> 11 one can use 
the so-called Limber approximation: 



/ 



dkF(k)je(kn)je(kr 2 ) 



2r\ 



S(n - r 2 )F(e/n) (40) 



The necessary condition is that F(k) is a smoothly varying 
function. When applied to the power spectra, this results in 
equation (8) simplifying to 

CT = J dr(T (r)W(r)) 2 P(e/d A )r- 2 . (41) 
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If additionally 8r <C r, one can simply further to 

Cr = T^r)P(£/d A )r- 2 (8r)-\ (42) 

where 8r is the effective width of the windows function W(r). 
Analogous expressions hold for other equations for cross- 
correlation spectra are: 

CT = T (r)-^-P(£/d A )r~ 2 , (43) 
dr(r) 

C? V =T (r)-^_P(£/d A )r~\ (44) 

The interesting point here is that the auto-correlation 
power spectrum in units of temperature scales inversely with 
bandwidth (Sr), while the cross-correlation power spectra 
are, in the Limber limit, to first order independent of it. 

In Figure 4, we plot a sample spectrum using the full for- 
mula (8) and the two approximations in equations (39) and 
(42). We obtain the expected result: for narrow bandwidths 
and large angular scales the delta function approximation 
fares better, but for bigger bandwidths the Limber approxi- 
mation is more appropriate. The unexpected discrepancy at 
large bandwidths and small scales is likely due to ringing 
of the window function, which does not seem to cancel out 
completely as assumed in the Limber approximation. 



4 RESULTS 

The auto-correlation power spectra of our model are compa- 
rable with published results. We plot some of them, together 
with some published data in the Figure 5. The observational 
uncertainties in the parameters of our model give an overall 
uncertainty in the amplitude of the signal of about order 
of magnitude. The mean Comptonization parameter in our 
model is y — 5 x 10~ 6 , which is compatible with the COBE- 
FIRAS upper limit around at the same level (Mather et al. 
1994). The 2/im mean IR flux from first sources during reion- 
ization ~ 5 nW m~ 2 sr -1 an 2/im, which is within exper- 
imental upper limits on the IRB intensity (about 20 to 70 
nW mT 2 sr -1 ) at those wavelengths (Cambresy et al. 2001). 

In Figure 6 we plot the relative cross-correlation coef- 
ficient CF/y/CpC?* 1 for the cross- correlation of the 21cm 
and the IR signals. We naturally expect a negative cross- 
correlation between the red-shifted 21-cm signal and the in- 
frared light from the stars. Even in our simplified model this 
anti-correlation does not hold on large scales where the sig- 
nal is dominated by the density perturbations that correlate 
positively. The cross-over scale correspond to the typical size 
of the bubbles. Therefore, we generically predict that IR and 
21-cm line signals to be correlated on large scale and anti- 
correlated on small scales. The cross-over scale can be used 
to determine the typical size of the bubbles. Our calculations 
are probably accurate to only an order of magnitude, but the 
heuristics should nevertheless be robust. Secondly, we note 
that when small enough (~ 2/im) wavelength is used, then 
the IR signal is dominated by a rather narrow redshift range 
and the cross-correlation will be picked only when 21-cm 
observation are tuned to the same redshift range. When ob- 
serving the continuum radiation at larger wavelengths, the 
cross-correlation is much less pronounced, but present at all 
redshifts. In Figure 7 we plot the same for cross-correlating 
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Figure 4. The power spectra calculated for the reionization and 
bubble model used in this paper at z = 10 using three different 
approximation schemes discussed in the text. Solid line is the 
full calculation of the equation (8), dashed is the delta- function 
approximation of equation (39) and dotted line is the Limber 
approximation of equation (42). The bandwidth of 0.1 MHz was 
used for the top graph and 10MHz for the bottom one. The dot- 
dashed lines denote the expected A noise CV for LOFAR and SKA 
experiments. 



the 21-cm signal with the SZ signal. We note that this graphs 
looks suspiciously similar to the right panel of Figure 6. 

These features are easy to understand, if we look at 
the "source" functions plotted in Figure 8. There we plot 
the functions which are proportional to the amplitude of 
the signal given the same power spectrum. We note that 
particularly for the case of the IR signal, we can clearly 
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Figure 6. The cross-correlation power spectra for 21-cm anisotropy at redshifts between 8 and 19 correlated against IRB observed at 
wavelengths of 1.94/im (left panel; frequency chosen so that peak occurs at integer redshift, see also Figure 8) and Afim (right panel). 
The bandwidth of the 21-cm signal was set to 10MHz. The redshift of maximum amplitude is 15 for 1.94/im and 11 for for Afim. 



see the features of the spectrum plotted in the Figure 2: 
If we choose to observe the spectrum at large wavelengths, 
the spectrum is essentially featureless, while if we choose to 
observe at ~ 2/im we get the pronounced spike at z ~ 15. It 
is clear from this graph, the the maximum cross-correlation 
for either SZ or IR signal at 4pim should happen at z ~ 11, 
while the IR correlation is nearly completely sourced at z ~ 
15. 



4.1 Observability 

Is the signal large enough to be realistically observed by any 
of the future experiments? 

There are two sources of noise present in any attempt to 
detect cross-correlation between the two fields: the intrinsic 
noise properties of instruments used and the sample variance 
that limits our ability to detect correlations due to a finite 
solid angle of the sky being observed. 

The noise power spectrum for the planned future radio 
telescopes can be, under some simplifying assumptions be 
written as (Zaldarriaga et al. 2004) 

e 2 cr isc = r s 2 ys (2^) 2 / i \ 

2-K Svtintf cover \i max / 

where T sys is the system temperature, 5v is the experiment 
bandwidth, 5v is the experiment bandwidth, tint is the in- 
tegration time and / CO ver is the covering fraction. The error 
on the estimated power spectrum is then given by 

P ■ 

A noise /,»» ^noisc ^min / a/^\ 

C e ~ C t —j- (46) 

Very similar consideration hold for IR and SZ experi- 



ments. The noise power spectrum is given by 

Cr is ° = 4^/ Sky -^, (47) 

i Vpix 

where / s k y is the fraction of sky observed (or being observed 
by both experiments for cross-correlation studies), -/V p i x is 
the number of pixel and a is noise per pixel. The error on 
the estimated power spectrum is then given by 

A^C e = f^^fr™ (48) 

In the Table 1 we list the expected sensitivities for a 
few experiments. We discuss the following experiments in 
this paper: Low Frequency Array (LOFAR; Salter et al. 
2004) 1 , the Square Kilometer Array (SKA; Carilli & Rawl- 
ings 2004) 2 , CIBER (Bock et al. 2006) 3 , AKARI (Mat- 
suhara et al. 2006) 4 and PLANCK (The Planck Collabo- 
ration 2006) 5 . 

In Figure 9, we plot predictions for cross-correlation 
power spectra and the instrumental sensitivities. 

However, sample variance is not necessarily a negligible 
problem in cross-correlation studies or cases with small / a k y : 
only a small part of the IR or SZ light cross-correlates with 
the relatively narrow-band 21-cm light and so the rest of the 
IR signal acts as a source of noise. As a crude approximation, 
we can assume that the skies are normally distributed and 



1 http://www.lofar.org 

2 http://www.skatelescope.org 

3 http://physics.ucsd.edu/ bkeating/CIBER.html 

4 http : //www . ir . isas . j axa . j p/ASTRO-F/Outr each/index_e . html 

5 http : //www . r ssd . esa . int /index . php?pr o j ect=Planck 
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Experiment 


Type 


t ■ - 


^raax 


/sky 


A noisc c e e 2 /2TT nte = io 3 


LOFAR 


21-cm 


~ 40 - 


- 10 4 


~ 0.5 


0.2mK 2 


SKA 


21-cm 


~ 40 - 


- 10 4 


~ 0.5 


0.002mK 2 


CIBER 


IR 


~ 10 3 - 


~ 10 4 


4 x 10" 4 


0.004 (nW/m 2 /sr) 


AKARI 


IR 


~ 2 - 5 


~ 10 5 


1 


3 X 10~ 4 (nW/m 2 /sr) 


Planck 


SZ 


~ 2 - ~ 


6000 


~ 1 


1 M K 2 



Table 1. This table lists the expected sensitivities for two 21-cm and one IR and CMB experiments. For 21-cm signal we assumed one 
month observation and 10MHz bandwidth (taken from (Zaldarriaga ct al. 2004)). For the rocket IR experiment, we assumed 1.8nW/m 2 /sr 
per pixel with pixel size 17" and 16 square degrees of observation and for the fictional AKARI FS (full sky) experiment we assumed noise 
of 11 /ijy, beam size of 1.46 arc mins, corresponding to the NEP (North Ecliptic Pole) survey, but expanding it from 6 square degrees 
to full sky. For Planck experiment we assumed full sky observation at 10 arcmin resolution and noise per pixel of 5 fiK 2 . 






Figure 9. This figure shows the same cross-correlations as in the Figures 6 and 7, but in absolute units. Left and middle panels are for 21- 
cm -IR cross correlation at 1.94/um and 4^tm respectively, the right panel is for 21-cm-SZ cross-correlation. The expected sensitivities for 
experiments in Table 1 are also plotted as dashed lines (outer contours are for cross-correlations with LOFAR, inner for cross-correlations 
with SKA. 



so the sample variance errors are given by 

Vi^&T'- (49 ' 

for £ ^> 2 and the dummy indices x and y are to be re- 
placed with n (21-cm signal), i (IR signal) or y (SZ signal). 
For / sky = 1 the sample variance is often termed cosmic 
variance. 

This justifies our choice of bandwidth to be 10 MHz. 
When entering the Limber regime, the 21cm signal drops in 
amplitude, while the cross-correlation signal stays, to first 
order, the same. Hence, the cross-correlation is easier to de- 
tect if we use wide-bandwidth experiment (or bin the data 
in a suitable manner). In Figures 6 and 7 we plotted the 
cosmic-variance limits as a dashed line. At 10MHz they do 
no seem to be a problem, but they might be at smaller band- 
widths. One must also note that these error-bars increase 
with / sky , where / s ky is the fraction of sky being observed 
and so this might be a serious problem for experiments that 
observe only small patches of the sky. Note, however, that 
these lines correspond to errors per single I: if one combines 
a range of i values as is usually the case, the error drops 
correspondingly. 

The total error is just the of sample variance error and 
the noise error added in quadrature: 



(A tot C £ ) 2 = (A SV C £ ) 2 + (A noiso C £ ) 2 (50) 

Which source of error is dominant? In the Figure 10 we 
plot the ratio of sample variance to noise error on Ct for 
various auto- and cross- spectra. We note that the sample 
variance is important in most of the experiments discussed 
here. This implies that increasing the fraction of the ob- 
served sky will improve constraints. 

The overall signal-to-noise ratio (SNR) for a given ex- 
periment or pair of experiments can be simply calculated 
by 

where index I runs over the multipole values for which a 
given experiment (or both in case of cross-correlation) has 
coverage. Assuming values from Table 1, we have estimated 
the total SNR for various experiments. These are given in 
the Table 2. Note that we chosen z = 14 as the target 21-cm 
redshift. If we chose the redshift corresponding to the peak 
of the Pop-Ill spectrum, the SNR values would have been 
much more favorable. 
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Figure 5. The auto-correlation power spectra contribution from 
early stars for IR (top) and SZ (bottom). For IR contribution we 
show the predicted power spectra, the data-points from Kashlin- 
sky with shot-noise contribution subtracted (for details see Sul- 
livan et al. 2006) for details). Straight dot-dashed lines are the 
expected A nolse Cf for the CIBER (top; see (top; for details see 
Bock et al. 2006) and AKARI (for details see Matsuhara et al. 
2006) experiments. For SZ contribution we plot the predicted 
auto-correlation power spectrum, the data-points from CBI (red), 
ACBAR (blue) (reseated by 4-3 to account for different observing 
frequency of ACBAR assuming that the signal is dominated by 
the SZ effect) and BIMA (green). The straight dot-dashed line is 
the expected A noisc C£ for the PLANCK experiment. 
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Figure 7. The same as figure 6 but for cross-correlating 21-cm 
with Comptonization parameter expressed in units of tempera- 
ture decrement in the Rayleigh-Jeans region of the spectrum. 




2 



Figure 8. This figure shows the functions that source auto- and 
cross- correlation power spectra as a function of redshift. We plot 
To?/) (solid line), d/„/dr0 (for u ^, s = 2fim which has spike at 
z ~ 15 and i/ t,s = 4/xm; dotted) and dy/drij>. All curves were 
normalized to unity at maximum. 

4.2 Recovering xh{z) 

From what it was said above, it is clear that there is hope for 
reconstruction reionization history using cross-correlations 
discussed here. In Figure 11 we plot the cross-correlation as 
a function of redshift at I = 6000 for a number of observing 
wavelengths. The same plot for SZ-21cm correlations would 
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Figure 10. This figure shows the ratio of sample variance to 
noise error per Cg,. The solid lines are for auto-correlations with 
experiments under consideration are given by line weight: LOFAR 
(thickest), SKA, CIBER, AKARI, Planck (thinnest). The dotted 
line are for cross-correlations and we plot the following combina- 
tions: LOFAR x CIBER (thickest), SKA x AKARI, LOFAR x 
Planck, SKA X AKARI Planck (thinnest). 



Experiment 


SNR 


LOFAR x LOFAR 


~ 250 


SKA x SKA 


~ 2000 


CIBER x CIBER 


~ 1000 


AKARI FS x AKARI FS 


~ 4 x 10 4 


Planck X Planck 


~ 800 


LOFAR x CIBER 


~ 40 


SKA x AKARI FS 


~ 1300 


LOFAR x Planck 


~ 36 


SKA x Planck 


~ 137 



Table 2. This table lists the expected signal-to-noise ratios for 
various experiments discussed, assuming the target redshift of 
z = 14 and observing wavelength ~ 2/im. 



qualitatively very similar to that of the case of observing at 
4/xm or 10 /im. In the same plot we also plot the value of 
x H (l-x H )(z/ll) 2 . 

It is clear from the plot that one can, if one observes at 
the right wavelength, in principle constrain xh(1 — xh) and 
therefore the neutral fraction as a function of time. We will 
not go into details of feasibility as there are many theoretical 
details that needs to be worked out, in particular the details 
of star- formation, spectral features, bubble size, etc.: diffi- 
cult problems, which should be nonetheless solved by more 
detailed modeling. We also note, that there are several spec- 
tra: many degeneracies can be broken by, for example, using 
auto-correlation spectra to recover star-formation histories, 
etc. 




z 



Figure 11. This figure shows the cross-correlation power spec- 
tra at i = 6000 as function of target for IR observing wave- 
lengths of 1 (im (solid), 2 (im (dotted), 4 [an (dashed) and 10 
(im (long dashed). The thick green dotted line in the plot of 
xk(1 -x h )(z/11) 2 



5 DISCUSSION AND CONCLUSIONS 

In this paper we have presented a simple model for auto- 
and cross-correlations of the 21-cm, IR and SZ signals from 
the first stars. In particular, the cross-correlations of 21-cm 
signal with either IR or SZ signals as a tool to measure 
reionization history was introduced for the first time. 

The cross-correlation power spectra have a distinct 
shape: on scales smaller than the bubble size the anti- 
correlate, because stars and supernovae glow in negative 
with respect to the 21-cm sky. On large scales, they both 
trace density fluctuations and correlate positively. 

Cross-correlations have the usual advantage over auto- 
correlations in that foregrounds automatically cancel, un- 
less foregrounds for two signals also cross-correlate, which 
is unlikely due to a very different nature of foregrounds 
for the signals discussed in this paper. A realistic possibil- 
ity are radio-galaxy /AGN that can bee seen both in radio 
through synchrotron emission and in the infrared by the 
dust emission. Di Matteo et al. (2004) have argued that if 
bright sources are removed above flux levels of S > O.lmJy 
the auto-correlation power spectra should be uncontam- 
inated for £ < 10 4 and only contaminated at O(l) for 
I > 10 4 . One could argue that something similar holds 
for the cross-correlation. If significant, one could model this 
cross-correlation and add it to the model: note that the sign 
of cross-correlation is different on small scales and hence 
this cannot produce a false detection. Finally, one could re- 
sort to using spectral cleaning techniques (Morales et al. 
2006), although this would defy one of the main advantages 
of cross-correlation. 

We have shown that the tomography of cross- 
correlations of 21-cm signal with the IR and SZ signal has, 
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in principle, potential to strongly constrain the reionization, 
especially in the region when xh ~ 0.5. When observed at 
smaller IR wavelengths, the cross-correlation actually traces 
the spectrum of the primordial stars and strong features, 
such as Lyman-alpha peak could be clearly reconstructed 
from the data. 

We have roughly modeled the noise properties of several 
up-coming experiments. Detection properties of experiments 
to detect cross-correlations is often limited by a limited frac- 
tion of sky observed with the first-generation of experiments. 
Nevertheless, we predict that detection is within the reach 
of the next-generation experiments. The most problematic 
at the moment is the IR field: the early experiments such 
as CIBER or the AKARI NEP (North Ecliptic Pole) sur- 
vey are simply limited by a small fraction of the sky being 
observed. A full-sky map at the sensitivity of AKARI NEP, 
will be clearly able to detect the cross-correlation. We note, 
that the signal-to-noise ratios given in the Table 2 are for a 
single 21-cm signal cross-correlation: stacking many of them 
will, of course, improve the signal-to-noise ratio accordingly. 

Finally, we note that our model is overly simplistic 
in many aspects. Firstly, we assume a continuous Pop-Ill 
star-formation together with continuous production of mas- 
sive supernovae. In reality, however, massive supernovae will 
pollute their environment with metals and hence quench 
the Pop-Ill star formation, leading to a fast transition to 
the Pop-II stars. Alternatively, supernovae are not so en- 
ergetic and Pop-Ill stars can actually completely reionize 
the Universe. In the former case, our high-redshift calcula- 
tions still hold, but at lower redshifts both IR and SZ cross- 
correlations disappear as both Pop-II star-light and the cor- 
responding supernovae are much less energetic. In the latter 
case, IR correlates with 21-cm throughout the duration of 
reionization, but we overestimated cross-correlation with SZ. 
In either case, the cross-correlations can be used to constrain 
reionization process. Secondly, the constant bubble-size used 
here cannot hold. In reality, the bubble size increases and 
gets to the level of few Mpc at the peak of the reionization 
process, when xh ~ 0.5 and the bubbles have not merged 
yet. Latter the bubbles merge and the bubble-size is not a 
well-defined quantity. In this work we used 4 h^ 1 Mpc for 
a typical size of bubble and 2h~ 1 Mpc for the typical size 
of sources. Lowering the size of sources pushes the auto- 
correlation and cross-correlation spectra up. However, the 
relative amplitudes decreases and hence the prospects for 
detection decrease. Thirdly, the cross-correlation terms be- 
tween the neutral fraction and the density were ignored in 
this work, following the analysis in Zaldarriaga et al. (2004). 
While these terms might be important, it is not immedi- 
ately clear what would their effect be. An analytical model 
for cross-correlation has been developed in Furlanetto et al. 
(2004) and the study of this effect is deferred to a forthcom- 
ing publication. 

In this paper we have put forward a novel idea for ex- 
tracting information of reionization. Adding details to the 
model, such as bubble size growth, neutral fraction - density 
correlation, etc., either through a more accurate analytical 
modeling, but probably more likely through insight from nu- 
merical codes, will probably improve and somewhat change 
predictions presented here, but hopefully only at the quan- 
titative level. 
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